Figure 1: cDNA sequence (and encoded amino acid sequence) o 
Cynomologous monkey Cathepsin S 



1 AGTTGAACTGAAATCTCCCTOCTGCrcCTTTGAAATCITAGAAGAGAGCCCA 
61 AGGATTCTTACTGTAGGAGCACCTGCTGGTTCTATCACAATGAAACAGCTGGTTTGTGTG 

M K Q L V C V 

♦ 

121 ctcttggtgtgctcctctgcagtggcgcagttgc ataaagatcctaccctggatcatcac 

llvcssavaqlhkdptldhh 
181 tggcatctctggaagaaaacctatggcaaacaatacaaggaaaagaatgaagaagcagta 

whlwkktygkqykekneeav 
241 cgacgtctc atctgggaaaag aatctaaagtttgtgatgcttc ataacctggagcattca 

rr;jiweknlkfvmlhnlehs 
301 atgggaatgc actcat atgatctgggc atgaacc acctgggagac atgacc agtgaagaa 

mgmhsydlgmnhlgdmtsee 
361 gtgatgtctttgatgagttccctgagagttcccagccagtggcagagaaatatcacatat 

vmslmsslrvpsqwqrnity 

i 

421 AAGTC AAACGCTAATCAGATATTGCCGGATTCTGTGGACTGGAGAGAGAAAGGGTGTGTT 

KSNANQILPDSVDWREKGCV 
481 ACTGAAGTGAAATATCAAGGTTCTTGTGGTGCTTGCTGGGCTTTC AGTGCTGTGGGGGCC 

TEVKYQGSCGACWAFSAVGA 
541 CTGGAAGCACAGCTGAAGCTGAAAACAGGAAAGCTGGTGTCTCTCAGTGCCCAGAACCTG 

LEAQLKLKTGKLVSLSAQNL 
601 GTGGATTGCTCAACTGAAAAATATGGAAACAAAGGCTGCAATGGTGGCTTCATGACAAGG 

VDCSTEKYGNKGCNGGFMTR 
661 GCTTTCCAGTACATCATTGATAACAACGGCATCGACTCAGACGCTTCCTATCCCTACAAA 

AFQYI IDNNGIDSDASYPYK 
721 GCCACGGATCAGAAGTGTCAATATGACTCAAAATATCGTGCTGCCACATGTTCAAAGTAT 

ATDQKCQYDSKYRAATCSKY 
781 ACTGAACTTCCTTATGGCAGAGAAGATGTCCTGAAAGAAGTTGTGGCCAATAAAGGCCCA 

TELPYGREDVLKEVVANKGP 
841 GTGTCTGTTGGTGTGGATGCGAGTCATCCTTCTTTCTTCCTCTACAGAAGTGGTGTCTAC 

VSVGVDASHPSFFLYRSGVY 
901 TATGAACCATCCTGTACTC AGAATGTGAATC ATGGTGTACTTGTGGTTGGCTATGGTGTT 

YEPSCTQNVNHGVLVVGYGV 
961 CTTAACGGGAAAGAAT ACTGGCTTGTGAAAAAC AGCTGGGGCCGC AACTTTGGTGAAGAA 
LNGKEYWLVKNSWGRNFGEE 
1021 GGATATATTCGGATGGC AAGT^TAAAGGAAATCATTGTGGGATTGCTAGTTTCCCCTCT 

GYIRMARNK GNHCGIASFPS 
1081 TACCCAGAAATCTAGAGAGGATCTCTTCTTTTTATAACAAATCAAGAAAATATGT^GCAC 

Y P E I * SEQ ID N0;2 

1141 TTTCTCTTAACTTAATTTTTCCTGCTGTATCCAGAAGAAATAATTGTGTC ACGATTAATG 
1201 TGTATTTACTGTACTAATTAAAAAAT ATAGTTTGAGGCCGGGCACGGTGGCTC ACGCCTG 
1261 TAATCCCAGTACTTTGGGAGGCCAAGGCAGGCATATCAACTTGAGGCCAGGAGTTAAAGA 
1321 CAGGCCTGGCTAAC ATGGTGAAACCCCGTCTCTACTAAAAATAC AAAAC ATTAGCGGAGC 
1381 GTAATGGTGCATGCCTGTAATCCCAGCTACTTGGGAGGCTGAAGCACAAGATTCCTTGAA 
1441 CCC AAGAGGTTGAGGCTGTGGTGAGCTGAGACCAC ACCACTGTACTCCAGCCTGGACAAC 
1501 AGAGTGGAGACTCTGTTTCAAAAAAACAGAAAAGATAATATAGTTTGATTCTTCGTTTTT 
1561 T AAAATTTGC AAACCTCAGGAAAAAGTTTGCT AAGTAAATTAGTAGTGTACTATAGAT AT 
1621 AACTGTATAAAAATTGTTCAACCTAAAACAATCTGTCATTGTTCATTGTTTTATTTTATA 
1681 CTerTTGTCTTTTTAAGACCCCTGATAGCCTTTTGTAACTTGATGGCTTAAAAGTAC^ 

1741 ATAAATCTGCCATTTCAAATTTCATTTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA SEQ ID NO:l 



♦ = cleavage site for pre-sequence 
i = cleavage site for pro-sequence 

* = stop codon 



FIGURE 1 



Figure 2: Alignment of cathepsin S amino acid sequences from Cynomologous monkey 
(SEQ ID N0:2, as shown in Fig. 1), human (Genbank accession number BC002642.2), 
monkey sml cells (Saimiri boliviensis)^ dog {Canis familiarise Genbank accession number 
AY156692), and mouse {Mus musculus; Genbank gi3850787), followed by a consensus 
sequence. 



"cyno cats preprotein" = Cynomologous monkey = SEQ ID NO:2 

"smlmonkey cats" = monkey sml cells (Saimiri boliviensis) = SEQ ID NO:3 

"heats prepropro.txt" = human {Homo sapiens) = SEQ ID N0:4 

"dog cats" = dog (Canis familiaris) = SEQ ID N0:5 

"mouse cats prepro" = mouse (Mus musculus) = SEQ ID N0:6 

(consensus sequence = SEQ ID NO:7) 
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Figure 3: Purification of monkey Cathepsin S (Gelcode stained gel) 
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FIGURE 3 BEST AVAILABLE COPY 



